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1 Devirtualizable virtual machines enabling general, single-node, online 
maintenance 

David E. Lowell, Yasushi Saito, Eileen J. Samberg 

October 2004 Proceedings of the 11th international conference on 

Architectural support for programming languages and operating 

systems, Volume 32 , 38 , 39 Issue 5,5,11 
Full text available: pdf(1 74.01 KB) Additional Information: full citation , abstract , references , index terms 

Maintenance is the dominant source of downtime at high availability sites. 
Unfortunately, the dominant mechanism for reducing this downtime, cluster rolling 
upgrade, has two shortcomings that have prevented its broad acceptance. First, 
cluster-style maintenance over many nodes is typically performed a few nodes at a 
time, mak-ing maintenance slow and often impractical. Second, cluster-style 
maintenance does not work on single-node systems, despite the fact that their 
unavailability during mainte ... 

Keywords: availability, online maintenance, planned downtime, virtual machines 



Virtual [machjn^ 
computing 

Tal Garfinkel, Ben Pfaff, Jim Chow, Mendel Rosenblum, Dan Boneh 
October 2003 Proceedings of the nineteenth ACM symposium on Operating 
systems principles 

Full text available* f§ pdf(140 31 KB) Additional Information: fuN.crtation, abstract, references, citings, index 
^ ' terms 

We present a flexible architecture for trusted computing, called Terra, that allows 
applications with a wide range of security requirements to run simultaneously on 
commodity hardware. Applications on Terra enjoy the semantics of running on a 
separate, dedicated, tamper-resistant hardware platform, while retaining the 
ability to run side-by-side with normal applications on a general-purpose 
computing platform. Terra achieves this synthesis by use of a trusted virtual 
machine monitor (TVMM ... 



Keywords: VMM, attestation, authentication, trusted computing, virtual machine, 
virtual machine monitor 
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An implementation scheme for a virtual machine monitor to be realized on user Q 
- microprogrammable minicomputers 

B. D. Shriver, J. W. Anderson, L. J. Waguespack, D. M. Hyams, R. A. Bombet 
October 1976 Proceedings of the annual conference 

Full text available: Wl pdf(654.60 KB) Additional Information: testation, abstract, references, citings, index 
^ '* terms 

A virtual machine monitor allows several different operating systems to run 
concurrently on the same machine. This paper presents the description of a virtual 
machine monitor and its support structure which can be implemented on a 
microprogrammable minicomputer or a distributed network of such machines. In 
our approach, all storage, transformational, input, and output resources of the 
system are accessed through a mapping mechanism. The design and 
implementation methodology for an actual re ... 

Helper threads via virtual multithreading on an experimental itanium® 2 
processor-based platform 

Perry H. Wang, Jamison D. Collins, Hong Wang, Dongkeun Kim, Bill Greene, Kai-Ming 
Chan, Aamir B. Yunus, Terry Sych, Stephen F. Moore, John P. Shen 
October 2004 Proceedings of the 11th international conference on 

Architectural support for programming languages and operating 

systems, Volume 39 , 38 , 32 Issue 11,5,5 
Full text available: * ^ pdf(225.47 KB) Additional Information: full citation , abstract , references , index terms 

Helper threading is a technology to accelerate a program by exploiting a 
processor's multithreading capability to run 1 "assist" threads. Previous 
experiments on hyper-threaded processors have demonstrated significant 
speedups by using helper threads to prefetch hard-to-predict delinquent data 
accesses. In order to apply this technique to processors that do not have built-in 
hardware support for multithreading, we introduce virtual multithreading (VMT), a 
novel form of switch-on-event user-level ... 

Keywords: DB2 database, PAL, cache miss prefetching, helper thread, itanium 
processor, multithreading, switch-on-event 



5 EASY— an operating 
Charles w7FMnk 

September 1977 ACM SI G MICRO Newsletter , Proceedings of the 10th annual 

workshop on Microprogramming, Volume 8 Issue 3 
Full text available* "j||pdf(733 1 9 KB) Additiona! Information: full citation , abstract , references , citings , index 

The Emulation Aid SYstem is a virtual machine monitor for the Nanodata QM-1 
microprogrammable computer. The system is designed to provide the user with an 
interactive interface for the development and subsequent use of emulations on the 
QM-1. EASY provides integrated support for: 1) interactive control of multiple, 
concurrently resident, virtual computers implemented via emulation, 2) 
input/output from emulations (virtual I/O) to the various real peripherals of the 
QM-1, and 3) diagnostic d ... 

Keywords: Emulation, Intermediate language machines, Microprogramming, 
Nanodata QM-1, Software engineering, Virtual machine monitors, Virtual machines 
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7 Archjtectu^ 
R. P. Goldberg 

March 1973 Proceedings of the workshop on virtual computer systems 
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terms 



In this paper we develop a model which represents the addressing of resources by 
processes executing on a virtual machine. The model distinguishes two maps: the 
0-map which represents the map visible to the operating system software running 
on the virtual machine, and the f-map which is invisible to that software but which 
is manipulated by the virtual machine monitor running on the real machine. The 
0-map maps process names into resource names and the f-map maps virtual resou 



8 Cellular disco: resource management using virtual clusters on shared-memory Q 
multiprocessors 

Kinshuk Govil, Dan Teodosiu, Yongqiang Huang, Mendel Rosenblum 

August 2000 ACM Transactions on Computer Systems (TOCS), Volume 18 Issue 3 



Despite the fact that large-scale shared-memory multiprocessors have been 
commercially available for several years, system software that fully utilizes all 
their features is still not available, mostly due to the complexity and cost of 
making the required changes to the operating system. A recently proposed 
approach, called Disco, substantially reduces this development cost by using a 
virtual machine monitor that laverages the existing operating system technology. 
In this paper we present a ... 

Keywords: fault containment, resource managment, scalable multiprocessors, 
virtual machines 



9 Cellular Disco: resource management using virtual clusters on shared-memory Q 
multiprocessors 

Kinshuk Govil, Dan Teodosiu, Yongqiang Huang, Mendel Rosenblum 

December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the 



Despite the fact that large-scale shared-memory multiprocessors have been 
commercially available for several years, system software that fully utilizes all 
their features is still not available, mostly due to the complexity and cost of 
making the required changes to the operating system. A recently proposed 
approach, called Disco, substantially reduces this development cost by using a 
virtual machine monitor that leverages the existing operating system technology. In 
this paper we present a syste ... 
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December 1993 ACM SIGOPS Operating Systems Review , Proceedings of the 
fourteenth ACM symposium on Operating systems principles, 

Volume 27 Issue 5 

Full text available- fP) pdf(1 47 MB) Additional Information: fu!!.citati?.n, attract, references, cjtings, index 
' terms 

In this paper we describe an architecture for extensible file systems. The 
architecture enables the extension of file system functionality by composing (or 
stacking) new file systems on top of existing file systems. A file system that is 
stacked on top of an existing file system can access the existing file system's files 
via a well-defined naming interface and can share the same underlying file data in 
a coherent manner. We describe extending file systems in the context of the 
Spring operating ... 

11 Jh§..DCM: a ha^ Q 
Robert Edward Boring 

October 1985 Proceedings of the 1985 ACM annual conference on The range of 
computing : mid-80's perspective: mid-80's perspective 

Full text available: Wi pdf(685.97 KB) Additional Information: full citation , references , index terms 



12 Robustness: Using model checking to debug device firmware I I 
Sanjeev Kumar, Kai Li 

December 2002. ACM SIGOPS Operating Systems Review, Volume 36 Issue SI 
Full text available: ^pdf(l : 72Jtf B) Additional Information: fu^citation, abstract, references 

Device firmware is a piece of concurrent software that achieves high performance 
at the cost of software complexity. They contain subtle race conditions that make 
them difficult to debug using traditional debugging techniques. The problem is 
further compounded by the lack of debugging support on the devices. This is a 
serious problem because the device firmware is trusted by the operating 
system. Model checkers are designed to systematically verify properties of 
concurrent systems. Therefore, mod ... 

13 A microarchitecture description language for retargeting firmware tools Q 
J. F. Nixon, S. R. Schach, R. I. Winner 

December 1986 ACM SIGMICRO Newsletter , Proceedings of the 19th annual 

workshop on Microprogramming, Volume 17 Issue 4 
Full text available: f ^pdf(1.02 MB) Additional Information: full, citation, abstract, references, citings, index 
^ terms 

ARCHI is a microarchitecture description language designed to serve as the 
underlying language for a retargetable firmware development environment. ARCHI 
provides a hierarchical, procedural description at the register transfer level of the 
target microarchitecture. This description can then be utilized by a 
meta-environment to generate an instantiation of the firmware development 
environment for a specific target microarchitecture. ARCHI has been successfully 
used to generate deb ... 

Chiaki Ishikawa, Ken Sakamura, Mamoru Maekawa 

December 1981 ACM SIGMICRO Newsletter , Proceedings of the 14th annual 

workshop on Microprogramming, Volume 12 Issue 4 
Full text available: ^ pdf(878.57 KB) Additional Information: full citation , abstract , references , index terms 



4 of 6 



6/14/05 3:53 PM 



Results (page 1): vmm efi extensible firmware 



http://portal.acm.org/resdtsxfm?coll=ACM&dl=ACM&CFID=47. 



This paper discusses the important problem of the adaptation and the 
personalization of VLSI-based computer systems by means of microprogramming. 
In order to fully exploit the VLSI chip with one million transistors on it, we propose 
the scheme of mass-producing the general purpose devices and then specializing 
them to particular application needs at the instruction set processor level and at 
upper levels of the computer systems. Many algorithms and examples are given. 
We show what facilitie ... 

Keywords: Adaptation and personalization, Application-oriented architecture, 
Tuning of computer systems, VLSI-based microprogrammed computer architecture 

15 A firmware monitor to support vertical migration decisions in the UNIX I I 
B. Holtkamp, H. Kaestner 

October 1982 ACM SIGMICRO Newsletter , Proceedings of the 15th annual 

workshop on Microprogramming, Volume 13 Issue 4 
Full text available* fll£df(686 64 KB) Additional Information: full citation , abstract , references , citings , index 
terms 

From a methodological point of view vertical migration involves the following four 
steps: identification of migration objects, prediction of expected system 
improvements, implementation, and verification of the results. In this paper a 
firmware monitor is presented as a support tool for the first and fourth step. The 
application environment for this monitor is a PDP-11/60 with writable control store 
running the UNIX operating system. Based upon a UNIX system model the 
requirement ... 

16 Firmware factory & forth I I 
Brad Eckert 

December 1999 ACM SIGPLAN Notices, Volume 34 Issue 12 

Full text available: ^.pdf(373.72 KB) Additional Information: fu!!.citation, citings, index terms 



17 An eng^ Q 
D. Becker, R. K. Singh, S. G. fell 

July 1992 Proceedings of the 29th ACM/IEEE conference on Design 
automation 

Full text available: ^ pdf(583.00 KB) Additional Information: full citation , references , citings , index terms 



18 VHDL and Ada firmware □ 
D. L. Barton 

July 1989 Proceedings of the conference on TRI-Ada '88 

Full text available: ^p.p3{414;23„KB) Additional Information: fuILcitation, index jerms 



19 Heart: An operating system nucleus machine implemented by firmware 
N. Kamibayashi, H. Ogawana, K. Nagayama, H. Aiso 

March 1982 Proceedings of the first international symposium on Architectural 
support for programming languages and operating systems, 

Volume 17 , 10 Issue 4 , 2 
Full text available: ^.pdf[ZilJ6. KB) Additional Information: full citation, abstract, references, index terms 
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This paper discusses the role of microprogramming in operating system design and 
shows several things: (1) advantages of the efficiency which may be gained from 
microcoded operating system primitives, (2) selecting the most appropriste 
primitives for implementation, and (3) an analysis of the tradeoffs among software, 
firmware, and hardware. The authors propose a practical approach of enhancing 
computer architecture level, from a view point of functional hierarchy of operating 
systems. In o ... 

20 The implementation of a user-extensible system on a dynamically I I 

microprogrammable computer 
Fergus K. Fung, Willis K. King 

September 1977 ACM SIGMICRO Newsletter , Proceedings of the 10th annual 

workshop on Microprogramming, Volume 8 Issue 3 
Full text available: ^£^{308.59 KB) Additional Information: M.crtatiqn p aMfact, references, index terms 

On a dynamically user-microprogrammable computer the user can tailor the 
machine to his needs by constructing microprogrammed routines and adding them 
to the system. If these routines are recognized by the assembler, then using them 
is no different from using any other basic machine instruction of the computer. The 
base machine is thus extended. The design and implementation of such a 
user-extensible system is described. It consists of 2 main parts: a pager which 
manages a virtual memory fo ... 
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1 Virtual memory mapped network interface for the SHRIMP multicomputer 
Matthias A. Blumrich, Kai Li, Richard Alpert, Cezary Dubnicki, Edward W. Felten, 
Jonathan Sandberg 

August 1998 25 years of the international symposia on Computer architecture 
(selected papers) 

Full text available: f| 3 pdf(1.39 MB) Additional Information: full citation , references , index terms 



2 Parallel text retrieval on a high performance supercomputer using the Vector 
Space Model 

P. Efraimidis, C. Glymidakis, B. Mamalis, P. Spirakis, B. Tampakas 
July 1995 Proceedings of the 18th annual international ACM SIGIR conference 
on Research and development in information retrieval 

Full text available: W\ pdf(959.41 KB) Additional Information: full citation , references , index terms 



3 Using history of computing to address problems and opportunities 
Orit Hazzan, John Impagliazzo, Raymond Lister, Shimon Schocken 
February 2005 ACM SIGCSE Bulletin , Proceedings of the 36th SIGCSE technical 

symposium on Computer science education, Volume 37 Issue 1 
Full text available: ^.pdf[.l.§9 -44JKJB) Additional Information: MLSlMSfl. leMe.Qces, index terms 



Keywords: CS education research, computer science, computer science education, 
history of computing, software engineering 
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5 The omni-directional treadmill: a locomotion device for virtual worlds 
Rudolph P. Darken, William R. Cockayne, David Carmein 
October 1997 Proceedings of the 10th annual ACM symposium on User 
interface software and technology 

Full text available: f| 3 pdf(1.30 MB) Additional Information: full citation , references , citings , index terms 



Keywords: exertion devices, input devices, locomotion, maneuvering, virtual 
environments, virtual reality 



6 An Abstract Machine for Symbolic Computation | 
D. L. Overheu 

July 1966 Journal of the ACM (JACM), Volume 13 Issue 3 

Full text available: ^pjff&MM?.) Additional Information: fuJLcitatiqn, abstract , references, index terms 

The design of an abstract machine with a recursive function programming language 
which avoids the predicate type of conditional is described. It is shown that trough 
the adoption of list processing techniques it has been possible to construct a simple 
simulator for the machine in FORTRAN. A program for the machine which causes it 
to perform symbolic differentiation with some algebraic manipulation of the 
expressions concerned is given as an example of the type of computations which 
may be p ... 

7 Intersecting solids on a massively paralle l processor 
Michael Karasick, David Strip 

January 1995 ACM Transactions on Graphics (TOG), Volume 14 Issue 1 

Full text available* W[ pdf(2 36 MB) Additional Information: full citation , abstract , references , index terms , 

Solid modeling underlies many technologies that are key to modern manufacturing. 
These range from CAD systems to robot simulators, from finite-element analysis to 
integrated circuit process modeling. The accuracy, and hence the utility, of these 
models is often constrained by the amount of computer time required to perform 
the desired operations. In this paper we present, in detail, an efficient algorithm 
for parallel intersections of solids using the Connection Machine, a massively 
paralle ... 

Keywords: CAGD, grid data, hierarchical descriptions, spline surfaces, tensor 
product 



8 UsinjajexicaJ Q 
Anne Abeilie, Yves Schabes, Aravind K. Joshi 

August 1990 Proceedings of the 13th conference on Computational linguistics - 
Volume 3 

Full text available: pdf(569.06 KB) Additional Information: full citation , abstract , references , citings 

Lexicalized Tree Adjoining Grammar (LTAG) is an attractive formalism for linguistic 
description mainly because of its extended domain of locality and its factoring 
recursion out from the domain of local dependencies (Joshi, 1985, Kroch and Joshi, 
1985, Abeilie, 1988). LTAG's extended domain of locality enables one to localize 
syntactic dependencies (such as filler-gap), as well as semantic dependencies 
(such as predicate-arguments). The aim of this paper is to show that these 
properties c ... 
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Helper threads via virtual multithreading on an experimental itanium- 2 
processqr-based pjatform 

Perry H. Wang, Jamison D. Collins, Hong Wang, Dongkeun Kim, Bill Greene, Kai-Ming 
Chan, Aamir B. Yunus, Terry Sych, Stephen F. Moore, John P. Shen 
October 2004 Proceedings of the 11th international conference on 

Architectural support for programming languages and operating 

systems, Volume 39 , 38 , 32 Issue 11,5,5 
Full text available: ^p.^(225^7.KB} Additional Information: MLcitation, abstract, references^ index. terms 

Helper threading is a technology to accelerate a program by exploiting a 
processor's multithreading capability to run ' "assist" threads. Previous 
experiments on hyper-threaded processors have demonstrated significant 
speedups by using helper threads to prefetch hard-to-predict delinquent data 
accesses. In order to apply this technique to processors that do not have built-in 
hardware support for multithreading, we introduce virtual multithreading (VMT), a 
novel form of switch-on-event user-level ... 

Keywords: DB2 database, PAL, cache miss prefetching, helper thread, itanium 
processor, multithreading, switch-on-event 



10 Background execution of time warp programs 
Christopher D. Carothers, Richard M. Fujimoto 

July 1996 ACM SIGSIM Simulation Digest , Proceedings of the tenth workshop 

on Parallel and distributed simulation, Volume 26 Issue 1 
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Vicki de Mey, Simon Gibbs 

September 1993 Proceedings of the first ACM international conference on 
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April 1992 Proceedings of the 1992 ACM/SIGAPP Symposium on Applied 
computing: technological challenges of the 1990 s 

Full text available: ^ pdf(729.10 KB) Additional Information: full citation , references , index terms 
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S. A. Felperin, L. Gravano, G. D. Pifarre, J. L C. Sanz 
August 1991 Proceedings of the 1991 ACM/IEEE conference on 
Supercomputing 

Full text available: pdf(978.41 KB) Additional Information: full citation , references , citings , index terms 



is Ada packages and distributed systems 
Warren H. Jessop 

February 1982 ACM SIGPLAN Notices, Volume 17 Issue 2 
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16 Virtual machine monitors: Terra: a virtual machine-based platform for trusted Q 
computing 

Tal Garfinkel, Ben Pfaff, Jim Chow, Mendel Rosenblum, Dan Boneh 
October 2003 Proceedings of the nineteenth ACM symposium on Operating 
systems principles 

Full text available - f ®\ pdf(140 31 KB) Additiona l Information: fun.citation, abstract, references, citings, index 
Ix^h_j : terms 

We present a flexible architecture for trusted computing, called Terra, that allows 
applications with a wide range of security requirements to run simultaneously on 
commodity hardware. Applications on Terra enjoy the semantics of running on a 
separate, dedicated, tamper-resistant hardware platform, while retaining the 
ability to run side-by-side with normal applications on a general-purpose 
computing platform. Terra achieves this synthesis by use of a trusted virtual 
.machine monitor (TVMM ... 

Keywords: VMM, attestation, authentication, trusted computing, virtual machine, 
virtual machine monitor 



17 Virtual machine monitors: Xen and the art of virtualization Q 
Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf 
Neugebauer, Ian Pratt, Andrew Warfield 

October 2003 Proceedings of the nineteenth ACM symposium on Operating 
systems principles 

Full text available- pdf(168 76 KB) A^^ 0 " 3 ' Information: full citation , abstract , references , citings , index 
^ J£ ~ J terms 

Numerous systems have been designed which use virtualization to subdivide the 
ample resources of a modern computer. Some require specialized hardware, or 
cannot support commodity operating systems. Some target 100% binary 
compatibility at the expense of performance. Others sacrifice security or 
functionality for speed. Few offer resource isolation or performance guarantees; 
most provide only best-effort provisioning, risking denial of service.This paper 
presents Xen, an x86 virtual machine monit ... 

Keywords: hypervisors, paravirtualization, virtual machine monitors 
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18 The design of a virtual machine for Ada 
L. J. Groves, W. J. Rogers 

November 1980 ACM SIGPLAN Notices , Proceeding of the ACM-SIGPLAN 

symposium on Ada programming language, Volume 15 Issue 11 
Full text available: * ^pdf(127 MB) Additional Information: full citation , abstract, references 

An implementation of Ada should be based on a machine-independent translator 
generating code for a Virtual Machine, which can be realised on a variety of 
machines. This approach, which leads to a high degree of compiler portability, has 
been very successful in a number of recent language implementation projects and 
is the approach which has been specified by the U. S. Army and Air Force in their 
requirements for Ada implementations.This paper discusses the rationale, 
requirements and design of s ... 

19 The case for virtual register machines 

Brian Davis, Andrew Beatty, Kevin Casey, David Gregg, John Waldron 
June 2003 Proceedings of the 2003 workshop on Interpreters, virtual 
machines and emulators 

Full text available: ^.{^2^.65 KB) Additional Information: Ml citation, abstract, references, jndexjerms 

Virtual machines (VMs) are a popular target for language implementers. 
Conventional wisdom tells us that virtual stack architectures can be implemented 
with an interpreter more efficiently, since the location of operands is implicit in the 
stack pointer. In contrast, the operands of register machine instructions must be 
specified explicitly. In this paper, we present a working system for translating 
stack-based Java virtual machine (JVM) code to a simple register code. We 
describe the translatio ... 

Keywords: interpreter, register architecture, stack architecture, virtual machine 



20 Virtual machines: ReVirt: enabling intrusion analysis through virtual-machine 
logging and replay 

George W. Dunlap, Samuel T. King, Sukru Cinar, Murtaza A. Basrai, Peter M. Chen 
December 2002 ACM SIGOPS Operating Systems Review, Volume 36 Issue SI 

Full text available: l |||pdf( 156 MB) Additional Information: fuJLdtation, abstract, references, citings 

Current system loggers have two problems: they depend on the integrity of the 
operating system being logged, and they do not save sufficient information to 
replay and analyze attacks that include any non-deterministic events. ReVirt 
removes the dependency on the target operating system by moving it into a virtual 
machine and logging below the virtual machine. This allows ReVirt to replay the 
system's execution before, during, and after an intruder compromises the system, 
even if the intruder rep ... 
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1 Helper threads via virtual multithreading on an experimental itanium® 2 
Processor-b^ 

Perry H. Wang, Jamison D. Collins, Hong Wang, Dongkeun Kim, Bill Greene, Kai-Ming 
Chan, Aamir B. Yunus, Terry Sych, Stephen F. Moore, John P. Shen 
October 2004 Proceedings of the 11th international conference on 

Architectural support for programming languages and operating 

systems, Volume 39 , 38 , 32 Issue 11,5,5 
Full text available: ^.p.^(225.47.KB) Additional Information: MLc[tatio.Q. abstract, references^ 

Helper threading is a technology to accelerate a program by exploiting a 
processor's multithreading capability to run * x assist" threads. Previous 
experiments on hyper-threaded processors have demonstrated significant 
speedups by using helper threads to prefetch hard-to-predict delinquent data 
accesses. In order to apply this technique to processors that do not have built-in 
hardware support for multithreading, we introduce virtual multithreading (VMT), a 
novel form of switch-on-event user-level ... 

Keywords: DB2 database, PAL, cache miss prefetching, helper thread, itanium 
processor, multithreading, switch-on-event 



2 Prefetch injection based on hardware monitoring and object metadata 
Ali-Reza Adl-Tabatabai, Richard L. Hudson, Mauricio J. Serrano, Sreenivas 
Subramoney 

June 2004 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2004 

conference on Programming language design and implementation, 

Volume 39 Issue 6 

Full text available: ^| |pdf(288.00 KB) Additional Information: full citation , abstract , references , index terms 

Cache miss stalls hurt performance because of the large gap between memory and 
processor speeds - for example, the popular server benchmark SPEC JBB2000 
spends 45% of its cycles stalled waiting for memory requests on the Itanium® 2 
processor. Traversing linked data structures causes a large portion of these stalls. 
Prefetching for linked data structures remains a major challenge because serial 
data dependencies between elements in a linked data structure preclude the timely 
materialization ... 

Keywords: cache misses, compiler optimization, garbage collection, prefetching, 
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3 Lock reservation: Java locks can mostly do without atomic operations ^ 
Kiyokuni Kawachiya, Akira Koseki, Tamiya Onodera 

November 2002 ACM SIGPLAN Notices , Proceedings of the 17th ACM SIGPLAN 
conference on Object-oriented programming, systems, 
languages, and applications, Volume 37 Issue 11 

Full text available- f|) pdf(246 29 KB) Additional Information: full citation , abstract , references , citings , index 
^ terms 

Because of the built-in support for multi-threaded programming, Java programs 
perform many lock operations. Although the overhead has been significantly 
reduced in the recent virtual machines, One or more atomic operations are 
required for acquiring and releasing an object's lock even in the fastest cases.This 
paper presents a novel algorithm called lock reservation. It exploits thread locality 
of Java locks, which claims that the locking sequence of a Java lock contains a very 
Ion ... 

Keywords: Java, atomic operation, lock, monitor, reservation, synchronization, 
thread locality 



M. Anton Ertl, David Gregg 

May 2003 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2003 

conference on Programming language design and implementation, 

Volume 38 Issue 5 

Full text available: "HfidfJ 1 90.05 KB) Addrtional Information: fuHcitation, abstract, references, index terms, 
^ review 

Interpreters designed for efficiency execute a huge number of indirect branches 
and can spend more than half of the execution time in indirect. branch 
mispredictions. Branch target buffers are the best widely available form of indirect 
branch prediction; however, their prediction accuracy for existing interpreters is 
only 2%-50%. In this paper we investigate two methods for improving the 
prediction accuracy of BTBs for interpreters: replicating virtual machine (VM) 
instructions and combining seq ... 

Keywords: branch prediction, branch target buffer, code replication, interpreter, 
superinstruction 



5 Pinpointing Representative Portions of Large Intel© Itanium© Programs with Q 
Dynamic Instrumentation 

Harish Patil, Robert Cohn, Mark Charney, Rajiv Kapoor, Andrew Sun, Anand 
Karunanidhi 

December 2004 Proceedings of the 37th annual International Symposium on 
Microarchitecture 

Full text available: *| | pdf(224.59 KB) Additional Information: full citation , abstract 

Detailed modeling of the performance of commercial applications is difficult. The 
applications can take a very long time to run on real hardware and it is impractical 
to simulate them to completion on performance models. Furthermore, these 
applications have complex execution environments that cannot easily be 
reproduced on a simulator, making porting the applications to simulators difficult. 
We attack these problems using the well-known SimPoint methodology to find 
representative portions of an ... 



2 of 7 



6/14/05 3:56 PM 



Results (page 1): virtual machine monitor itanium 



http://portal.acnLorg/resultsxfm?coll=ACM&dl=ACM&CFro 



6 Field-testing IMPACT EPIC research results in Itanium 2 

John W. Sias, Sain-zee Ueng, Geoff A. Kent, Ian M. Steiner, Erik M. Nystrom, 
Wen-mei W. Hwu 

March 2004 ACM SIGARCH Computer Architecture News , Proceedings of the 
31st annual international symposium on Computer architecture 
Volume 00, Volume 32 Issue 2 

Full text available: ^£df(273,71 KB) Additional Information: MLcjtation, abstract 

Explicitly-Parallel Instruction Computing (EPIC) providesarchitectural features, 
including predication and explicitcontrol speculation, intended to enhance the 
compiler'sability to expose instruction-level parallelism (ILP) incontrol-intensive 
programs. Aggressive structural transformationsusing these features, though 
described in theliterature, have not yet been fully characterized in 
completesystems. Using the Intel Itanium 2 microprocessor,the SPECint2000 
benchmarks and the IMPACT Compilerfor ... 

Itanium© 2 Processor 
Jean-Francois Collard, Daniel Lavery 

March 2003 Proceedings of the international symposium on Code generation 

and optimization: feedback-directed and runtime optimization 

Full text available: f!3 pdf(1 .28 MB) m . . 

Ml * Additional Information: MLQrtation, abstract, references, .index terms 

This paper describes scheduling optimizations in the Intel® Itanium® compiler to 
prevent cache penalties due to various micro-architectural effects on the Itanium 2 
processor. This paper does not try to improve cache hit rates but to avoid 
penalties, which probably all 'processors have in one form or another, even in the 
case of cache hits. These optimizations make use of sophisticated methods for 
disambiguation of memory references, and this paper examines the performance 
improvement obt ... 

8 The Performance of Runtime Data Cache Prefetching in a Dynamic [jgj 

jiwei Lu, Howard Chen, Rao Fu, Wei-Chung Hsu, Bobbie Othmer, Pen-Chung Yew, 
Dong-Yuan Chen 

December 2003 Proceedings of the 36th annual IEEE/ACM International 
Symposium on Microarchitecture 

Full text available: ^.p.df{253,79 KB) Additional Information: Ml citation, abstract, citings, index terms 

Traditional software controlled data cache prefetching isoften ineffective due to the 
lack of runtime cache miss andmiss address information. To overcome this 
limitation, weimplement runtime data cache prefetching in the 
dynamicoptimization system ADORE (ADaptive Object code RE-optimization).Its 
performance has been compared withstatic software prefetching on the SPEC2000 
benchmarksuite. Runtime cache prefetching shows better performance. On an 
Itanium 2 based Linux workstation, it can increasepe ... 

Ruben Pinilla, Marisa Gil 

October 2003 ACM SIGOPS Operating Systems Review, Volume 37 Issue 4 
Full text available: ^pdf(3.39 MB) Additional Information: full citation , abstract , references 

Java is known to be a valuable technology for building platform independent 
applications, based on an independent execution environment provided by a virtual 
machine (JVM, Java Virtual Machine) and an API formed by a set of classes. The 
Java platform was conceived as a solution for application transportation between 
heterogeneous platforms without the need of adapting and recompiling the source 
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code. Some previous analyses of Sun JVM implementation (Java 2 SDK 1.2.2-006) 
establish that the HPI ( ... 

Keywords: HPI, JVM, Java threads, ULT, concurrency, kernel threads, 
multithreaded, scheduling, user threads 



10 Compilation: The impact of if-conversion and branch prediction on program Q 
execution on the Intel® Itanium™ processor 

Youngsoo Choi, Allan Knies, Luke Gerke, Tin-Fook Ngai 

December 2001 Proceedings of the 34th annual ACM/IEEE international 

symposium on Microarchitecture 

Full text available: f!3pdf(1.06 MB) 

Jlf" Additional Information: full citation, abstract, references, citings 

™PyM!sherS(te 

The research community has studied if-conversion for many years. However, due 
to the lack of existing hardware, studies were conducted by simulating code 
generated by experimental compilers. This paper presents the first comprehensive 
study of the use of predication to implement if-conversion on production hardware 
with a near-production compiler. To better understand trends in the 
measurements, we generated binaries at three increasing levels of if-conversion 
aggressiveness. For each level, we ... 

11 A proposal for a new hardware cache monitoring architecture Q 
Martin Schulz, Jie Tao, Jurgen Jeitner, Wolfgang Karl 

June 2002 ACM SIGPLAN Notices , Proceedings of the workshop on Memory 

system performance, Volume 38 Issue 2 supplement 
Full text available: ^ pdf(1.23 MB) Additional Information: full citation , abstract , references 

The analysis of the memory access behavior of applications, an essential step for a 
successful cache optimization, is a complex task. It needs to be supported with 
appropriate tools and monitoring facilities. Currently, however, users can only rely 
on either simulation based approaches, which deliver a large degree of detail but 
are restricted in their applicability, or on hardware counters embedded into 
processors, which allow to keep track of very few, mostly global events and hence 
only provi ... 

12 Virtual machine monitors: Terra: a virtual machine-based pla tform for trusted Q 
computing 

Tai Garfinkel, Ben Pfaff, Jim Chow, Mendel Rosenblum, Dan Boneh 
October 2003 Proceedings of the nineteenth ACM symposium on Operating 
systems principles 

Full text available- If) £df(140 31 KB) AdditionaI Information: full citation , abstract , references , citings , index 
^ " terms 

We present a flexible architecture for trusted computing, called Terra, that allows 
applications with a wide range of security requirements to run simultaneously on 
commodity hardware. Applications on Terra enjoy the semantics of running on a 
separate, dedicated, tamper-resistant hardware platform, while retaining the 
ability to run side-by-side with normal applications on a general-purpose 
computing platform. Terra achieves this synthesis by use of a trusted virtual 
machine monitor (TVMM ... 

Keywords: VMM, attestation, authentication, trusted computing, virtual machine, 
virtual machine monitor 
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14 Bytecode fetch optimization for a Java interpreter 
Kazunori Ogata, Hideaki Komatsu, Toshio Nakatani 

October 2002 Proceedings of the 10th international conference on 

Architectural support for programming languages and operating 
systems, Volume 36 , 37 , 30 Issue 5 , 10 , 5 

Full text available - ^ pdf(1.16 MB) Additional Information: Mlcitation, abstract, references, citings, index 

terms 

Interpreters play an important role in many languages, and their performance is 
critical particularly for the popular language Java. The performance of the 
interpreter is important even for high-performance virtual machines that employ 
just-in-time compiler technology, because there are advantages in delaying the 
start of compilation and in reducing the number of the target methods to be 
compiled. Many techniques have been proposed to improve the performance.of 
various interpreters, but none of ... 

Keywords: Java, PowerPC, bytecode interpreter, performance, pipelined 
interpreter, stack caching, superscalar processor 

15 An implementation scheme for a virtual machine monitor to be realized on user Q 
- microprogrammable minicomputers 

B. D. Shriver, J. W. Anderson, L J. Waguespack, D. M. Hyams, R. A. Bombet 
October 1976 Proceedings of the annual conference 

Full text available: W\ pdf(654.60 KB) Additional Information: fu|[ cM9.D, attract, references, citings, index 
^ terms 

A virtual machine monitor allows several different operating systems to run 
concurrently on the same machine. This paper presents the description of a virtual 
machine monitor and its support structure which can be implemented on a 
microprogrammable minicomputer or a distributed network of such machines. In 
our approach, all storage, transformational, input, and output resources of the 
system are accessed through a mapping mechanism. The design and 
implementation methodology for an actual re ... 

1 6 IM Iocs JesUHjame ss and. Report^ 

Shava Smalien, Catherine Olschanowsky, Kate Ericson, Pete Beckman, Jennifer M. 
Schopf 

November 2004 Proceedings of the 2004 ACM/IEEE conference on 
Supercomputing 

Full text available: ^.p.df(3.01^MB) Additional Information: MLcitation, abstract 

Virtual organizations (VOs), communities that enable coordinated resource sharing 
among multiple sites, are becoming more prevalent in the high-performance 
computing community. In order to promote cross-site resource usability, most VOs 
prepare service agreements that include a minimum set of common resource 
functionality, starting with a common software stack and evolving into more 
complicated service and interoperability agreements. VO service agreements are 
often difficult to verify and maint ... 
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17 OS Debugging Method Using a Lightweight Virtual Machine Monitor 
Tadashi Takeuchi 

March 2005 Proceedings of the conference on Design, Automation and Test in 
Europe - Volume 2 

Full text available: ^ pdf(85.92 KB) Additional Information: full citation , abstract 

Demands for implementing original OSs that can achieve high I/O performance on 
PC/AT compatible hardware have recently been increasing, but conventional OS 
debugging environments have not been able to simultaneously assure their 
stability, be easily customized to new OSs and new I/O devices, and assure 
efficient execution of I/O operations. We therefore developed a novel OS 
debugging method using a lightweight virtual machine. We evaluated this 
debugging method experimentally and confirmed that ... 

18 Virtual machine monitors: Xen and the art of virtualization 

Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf 
Neugebauer, Ian Pratt, Andrew Warfield 

October 2003 Proceedings of the nineteenth ACM symposium on Operating 
systems principles 

Full text available ^ pdf(1 68 76 KB) Additiona ' Information: full citation , abstract , references , citings , index 

Numerous systems have been designed which use virtualization to subdivide the 
ample resources of a modern computer. Some require specialized hardware, or 
cannot support commodity operating systems. Some target 100% binary 
compatibility at the expense of performance. Others sacrifice security or 
functionality for speed. Few offer resource isolation or performance guarantees; 
most provide only best-effort provisioning, risking denial of service.This paper 
presents Xen, an x86 virtual machine monit ... 

Keywords: hypervisors, paravirtualization, virtual machine monitors 



19 Track 7: compilers and operating systems: Dynamic run-time architecture Q 
techniques for enabling co optimization 

Tipp Moseley, Alex Shye, Vijay Janapa Reddi, Matthew Iyer, Dan Fay, David Hodgdon, 

Joshua L. Kihm, Alex Settle, Dirk Grunwald, Daniel A. Connors 

May 2005 Proceedings of the 2nd conference on Computing frontiers 

Full text available: ^ pdf(347.97 KB) Additional Information: full citation , abstract , references , index terms 

Future computer systems will integrate tens of multithreaded processor cores on a 
single chip die, resulting in hundreds of concurrent program threads sharing 
system resources. These designs will be the cornerstone of improving throughput 
in high-performance computing and server environments. However, to date, 
appropriate systems software (operating system, run-time system, and compiler) 
technologies for these emerging machines have not been adequately explored. 
Future processors will require so ... 

Keywords: multithreading, performance counters, profiling, scheduling 



20 Compilers II: Inter-procedural stacked register allocation for itanium® like 
architecture 

Liu Yang, Sun Chan, G. R. Gao, Roy Ju, Guei-Yuan Lueh, Zhaoqing Zhang 
June 2003 Proceedings of the 17th annual international conference on 
Supercomputing 

Full text available: *j| |pdf(478.20 KB) Additional Information: full citation , abstract , references , index terms 
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A hardware managed register stack, Register Stack Engine (RSE), is implemented 
in Itanium® architecture to provide a unified and flexible register structure to - 
software. The compiler allocates each procedure a register stack frame with its size 
explicitly specified using an alloc instruction. When the total number of registers 
used by the procedures on the call stack exceeds the number of physical registers, 
RSE performs automatically register overflows and fills to ensure that the c ... 

Keywords: hot region, hotspot, inter-procedural stacked register allocation, quota 
assignment, register allocation 
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All slide shows are in Adobe* Acrobat* format. 
Download Adobe Acrobat Reader 
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• EFI and Windows "Longhorn" - Microsoft Corporation 
WMV Downloads: part 1 part 2 part 3 part 4 

• Next Generation EFI Operating System "Linux & EFI" - Intel Corporation 
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